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Many complex networks are only a part of larger systems, where a number of coexisting topologies 
interact and depend on each other. We introduce a layered model to facilitate the description and 
analysis of such systems. As an example of its application we study the load distribution in three 
real-life transportation systems, where the lower layer is the physical infrastructure and the upper 
layer represents the traffic flows. This layered view allows us to capture the fundamental differences 
between the real load and commonly used load estimators, which explains why these estimators fail 
to approximate the real load. 



PACS numbers: 89.75.Hc, 89.75.Fb, 89.40.Bb, 89.20.Hh 



The topologies of the Internet at the IP layer [J, of 
the World Wide Web 2] or the networks formed by Peer 
To Peer (P2P) applications have recently drawn a lot 
of attention. These graphs have been studied separately, 
as distinct objects. However, they are closely related: 
each WWW or P2P link virtually connects two IP nodes. 
These two IP nodes are usually distant in the underly- 
ing IP topology and the virtual connection is realized as 
a path found by IP routers. In other words, the graph 
formed by an application is mapped on the underlying IP 
network. Although the topologies at both layers might 
share a number of statistical properties (such as a heavy- 
tailed degree distribution), they are very different. 
There exist layers also under the IP layer; even in a sim- 
plified view of the Internet we must distinguish at least 
one - the physical layer. It consists of a mesh of opti- 
cal fibers that are usually put in the ground along roads, 
rails, or power-lines. This results in topologies very dif- 
ferent from those observed at the IP layer. A mapping 
of the IP graph onto the physical layer must satisfy a 
number of constraints (see e.g., 0). 

Another important class of real-life systems is trans- 
portation networks. The graphs based on the physical 
infrastructure of such networks were analyzed on the ex- 
amples of a power grid |j| [(J , railway network Q , road 
networks [ a , B, or urban mass transportation systems 
[ToL ITTl Il2l Although this approach often gives a 

valuable insight into the studied topology, it ignores the 
real-life traffic pattern and hence captures only a part 
of the full picture. Interestingly, the networks of traffic 
flows were studied separately, for instance the flows of 
people within a city 1141. and commuting traffic flows be- 
tween different cities [l5|. These studies, in turn, neglect 
the underlying physical topology. A comprehensive view 
of the system requires to analyze both layers (physical 
and traffic) together. Of course, a partial knowledge of 
the traffic pattern could be introduced into the physical 
graph by assigning weights reflecting the amount of car- 
ried traffic to the physical edges. This describes well a 
specific type of transportation network, where all traffic 
flows are one-hop long and where the two layers actually 
coincide, such as airport networks However, in 

the presence of longer (than one hop) traffic flows, the 



weighted physical graph is not sufficient. For instance, 
the failure of a single physical node/edge should affect 
(delete or cause to reroute) all traffic flows using this 
edge/node, which requires the knowledge of the traffic 
graph and of the actual routes of these flows in the phys- 
ical graph. 

Coexisting and dependent graphs can also be observed 
in social networks |18| , where the same set of nodes may 
be connected in various ways, depending on the type of 
relationship chosen to be represented by edges. These 
graphs are related to each other. It is common, for in- 
stance, to establish a new link in a business relationship 
graph (e.g., to find a job) by performing a search in our 
acquaintanceship network (i.e., by asking our friends who 
ask their friends, etc) 0|. This new direct business link 
translates into a path in the acquaintanceship network. 

The above examples call for the introduction of addi- 
tional layers to the description of some complex systems. 
Therefore we propose a general multi-layered model. We 
explain it on the example of two layers; all the definitions 
naturally extend to any number of layers. In the two- 
layered model, the lower-layer topology is called physical 
graph = (V^,E^), and the upper-layer topology is 
called logical graph G A = (V X ,E X ). We assume that the 
sets of nodes at both layers are identical, i.e., = V x 1 
but as a general rule we keep the indexes cj) and A to make 
the description unambiguous. Let N be the number of 
nodes, A^=| V^| = |l/ A |. The physical and logical graphs 
can be directed or undirected, depending on the applica- 
tion. The nodes and edges can have weights assigned to 
them and denoted by w(-), with w — 1 for unweighted 
graphs. Every logical edge e A = (u A ,t> A ) is mapped on 
the physical graph as a path M(e A ) C connecting the 
nodes and v^, corresponding to u x and v x . (A path is 
defined by the sequence of nodes it traverses.) The set of 
paths corresponding to all logical edges is called mapping 
M(E X ) of the logical topology on the physical topology. 
Now, the load I of a node is the sum of the weights of 
all logical edges whose paths traverse xr: 



e A : v>t>tEM(e x ) 



(1) 



In a transportation network l(v^) is the total amount of 
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FIG. 1: (a) Illustration of the two-layered model. The logical graph G A is mapped onto the physical graph by a 

mapping M(E X ). In this example the logical edge e A is mapped on G^ as the path M(e A ) = (vf ,v^). Assuming 
that G x is unweighted, the loads of the three indicated nodes are l(vf) —3, liv^) — 2 and l(v^) — A. (b) A part of the logi- 
cal and physical graphs of the EU dataset. Here, the traffic intensities (weights) are indicated by multiedges in the logical graph. 



traffic that flows through the node if the logical graph 
is unweighted, l{v^) counts the number of logical edges 
that are mapped on . 

Here, we apply this two-layered framework to study 
transportation networks. The undirected, unweighted 
physical graph will henceforth capture the physical 
infrastructure of a transportation network, and the logi- 
cal graph G x will reflect the undirected traffic flows. All 
data studied in this paper is extracted from timetables 
of public transportation systems. First, we take a list of 
all of trains, metros and buses departing in the system 
within one weekday (time-span of 24 hours). A timetable 
gives the exact route of each vehicle, which translates di- 
rectly into a logical edge e A (connecting the first and the 
last station) and its mapping M(e A ). The number of ve- 
hicles following the same path in both possible directions 
defines the flow intensity - the weight w(e x ) of the logi- 
cal link. (In this context, the logical graph is equivalent 
to a traffic matrix in transportation science |2(j.) We 
describe the algorithm to extract the two layers and the 
mapping from timetables in |21| . 

Dataset I N I \W\ I I |_E A | I # vehicles 

WA - Warsaw 1529 1827 90 324 26075 
CH - Switzerland 1679 1750 142 539 7482 
EU - Europe 6276 7273 181 6623 54073 



TABLE I: The studied datasets. N is the number of nodes, 
\E^\ (respectively, |-B A |) is the number of edges in the 
physical (respectively, logical) graph, and dfi" denotes the 
diameter of the physical graph. The total number of vehicles 
taken into account for every dataset is given by vehicles" . 
Note that \E X \ <C # vehicles, because many vehicles follow 
the same route. 

We study three examples of transportation networks, 
with sizes ranging from city to continent. As an example 
of a city, we take the mass transportation system (buses, 
trams and metros) of Warsaw (WA), Poland. At a coun- 
try level, we study the railway network of Switzerland 



(CH) . Finally we investigate the railway network formed 
by major trains and stations in most countries of central 
Europe (EU) . The basic characteristics of these networks 
can be found in TableHJand in Fig. El Ah physical topolo- 
gies are connected, planar or close to planar, with the 
diameter in the order of O(Vn) (the diameter of a 
graph is the length of the longest of all possible shortest 
paths) , and node degree distributions decaying exponen- 
tially (the degree of a node is the number of edges inci- 
dent on this node). These features are common to many 
physical transportation graphs, such as a road network 
or a railway system. The logical graphs are strikingly dif- 
ferent. They are sparse and have multiple components, 
among which many isolated nodes. The degree distri- 
butions of the logical graphs are right-skewed, meaning 





FIG. 2: EU network (WA and CH yield similar results). Node 
degree distribution in the physical graph (a) , and in the logical 
graph (b); edge weight distribution in the logical graph (c); 
and the distribution of the lengths of traffic flows (d) , counted 
in a number of hops h in the physical graph. 
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a) Node degree 



b) Betweenness 



c) Real load I 






FIG. 3: EU dataset (WA and CH yield similar results). The first three figures present the physical layout of the node degree 
fc^ (a), betweenness b^ (b) and real load I (c). The size of a node is proportional to the measured value. In (d) we present the 
scatter-plots of the node degree (top) and betweenness W (bottom) versus the real load I. In the top left corner of every 
plot we give the value of the corresponding Pearson's correlation coefficient. 



that there is a small number of nodes with very high de- 
gree. This is confirmed by the almost linear shape of the 
distribution in the log-log scale plot shown in Fig. 03; a 
fully linear shape would indicate a power-law (a heavy- 
tailed distribution). Similar right-skewed distributions 
are observed for the weights of edges in the logical graphs 
(Fig. In Fig.|2Ji, we compare the length distribution 
of real traffic flows with the length distribution of all-to- 
all shortest paths. The former is very much left-skewed, 
which means that the real flows tend to be local. 

Knowing the topologies and the mapping of both lay- 
ers, we can easily compute the load of a node with for- 
mula For comparison purposes, we present below 
two load estimators based exclusively on the physical 
graph G*. For load estimators we take two metrics 
known in social networks as centrality measures; they 
are used to assess the importance of nodes. Our first 
metric is node degree fc*\ It seems natural that the nodes 
with a high degree carry more traffic than the less con- 
nected nodes. Our second metric is betweenness b^ |22| . 
The betweenness of a vertex v is the fraction of shortest 
paths between all pairs of vertices in a network, that 
pass through v. If there are more than one shortest 
path between a given pair of vertices, then all such paths 
are taken into account with equal weights summing to 
one. Betweenness aims at capturing the amount of in- 
formation passing through a vertex. Indeed, many au- 
thors take betweenness as a measure of load either di- 
rectly IM |H IH |28l, or with slight modifica- 
tions jaUaHa 

In Fig. [3] we compare the distribution of the real load 
with its two estimators: node degree and betweenness. 
The geographical patterns formed by the three metrics 
differ substantially (see Fig.|3tbc). To quantify these dif- 
ferences, in Fig. |3i we present the scatter plots of these 
two estimators versus the real load I. The correlations 
between them are very low, which is confirmed by low 
values of the corresponding Pearson's coefficients (top 
left corner of every plot). For instance, for the value of 



load I ~ 10 2 , the corresponding values of betweenness b^ 
cover more than two orders of magnitude. Surprisingly, 
contrary to the commonly admitted view, the node de- 
gree approximates the real load better than betweenness. 

Why do load estimators fail to mimic the real load pat- 
tern? Are there some fundamental reasons behind this? 
The layered view of the system is very helpful in answer- 
ing these questions. First, observe that the ways we com- 
pute node degree, betweenness and real load, can be uni- 
fied by recasting the first two in the two-layered setting. 
Indeed, both the node degree and the betweenness can 
be computed as the node load in two-layered systems 
with specific logical topologies mapped on the physical 
graph G^ using shortest paths. We denote these specific 
and G b A ^, for the node degree 



respectively. They are defined 



logical graphs by GA 
and the betweenness I 
as follows. 

In the case of the node degree, pick GA — G^: the log- 
ical graph is identical to the physical graph G*. Hence 
the mapping of Gy on G^ reduces trivially to single hop 
traffic flows, and (JTJ boils down to l(v^) = £^(1^). 
For the betweenness, GA, is an unweighted and complete 
(fully connected) graph. Indeed, the definition of be- 
tweenness requires to find shortest paths between every 
possible pair of vertices. Note that the mapping defined 
by betweenness splits the path (and its weight) if there 
are more than one shortest path, whereas the shortest- 
path mapping simply returns one of them. However, in 
large graphs this difference is negligible, especially if the 
shortest-path algorithm picks one of the possible paths 
at random. 

The same two-layered methodology can therefore be used 
to compute node degree, betweenness and real load. 
Moreover, in all three cases we use the same physi- 
cal graph G^ and a mapping that follows the short- 
est path j3^| . Consequently, all the differences between 
the three metrics are completely captured by the logical 
graphs GA, GA, and G A . We compare them in Table ITU 
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Planar Yes No No 



Weights w(e x ) — 1 =1 Right-skewed 

Degrees k(v ) Exponential = N — 1 Right-skewed 
Edge lengths = 1 Bell-shaped Exponential 

TABLE II: The properties of the logical graphs Gh , G^ and 
G A . "Edge length" is the number of hops in the mapping of 
the edge on the physical graph. 

The graph Gh is moderately dense, planar, unweighted, 
with the degree distribution decaying exponentially. The 
edge length, counted in the number of hops in the map- 
ping of this edge, is equal to one for all edges of GA . In 
contrast, the graph GA is an unweighted and complete 
graph, which means it is very dense, with every node of 
degree equal to N—l. In GA we find both short and long 
edges; their distribution is bell-shaped, as shown by the 
"all-to-all" curve in Fig. Finally, the real-life logical 
graph G x is sparse, weighted and has rather local edges 
(see the "real" curve in Fig. 01). Moreover, the node de- 
gree and edge weight distributions of G A are both very 
much right-skewed. 

There are thus a number of fundamental differences be- 



tween the three logical graphs Gh, GA and G A . They ex- 
plain why the node degree and betweenness fail to mimic 
the real load distribution. We expect to observe similar 
differences in other fields. For instance, the logical graph 
representing the traffic in the Internet shares many prop- 
erties with the logical graphs of transportation systems 
studied here. In particular, in the Internet, the distribu- 
tion of intensity of traffic flows (which is, in this paper, 
equivalent to the edge weights in the logical graph) was 
shown to be heavy-tailed |3ll l8^ | . This is known in the 
field as "the elephants and mice phenomenon" > where 
a small fraction of flows is responsible for carrying most 
of the traffic. Moreover, the number of flows originating 
from a given node (which is, in this paper, equivalent to 
the node degree in the logical graph), was also shown to 
follow a power-law distribution |3l| . 

To summarize, we have introduced a framework for 
studying complex systems in which we distinguish graphs 
on two or more layers. We have shown on the example 
of transportation networks how the layered view can fa- 
cilitate the description, comparison and analysis of such 
systems. Our work represents only a fraction of the possi- 
bilities in this area. For example, the layered perspective 
can completely change our view of the error and attack 
tolerance of considered systems. It would be also inter- 
esting to study how the properties of the topologies at 
different layers affect the interactions between the layers. 
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